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Abstract 

Gilbert Strang posited [7] that a permutation matrix of bandwidth w can be written as a 
product of A'^ < 2w permutation matrices of bandwidth 1. A proof employing a greedy "parallel 
bubblesort" algorithm on the rows of the permutation matrix is detailed and further points of 
interest are elaborated. 

1 Conjecture and Outline 

This section states the problem, starting with some necessary definitions. As a convention, M' will 
denote the transpose of a matrix M and / is the identity matrix. 

Definition 1. An n x n permutation matrix P contains only Os and Is, with only one 1 per row and 
column. The permutation of a column vector x, where x' = [\ 2 ■ ■ ■ n], is the column vector Px. 

[niij] is said to he of bandwidth w, denoted by band(P) := w, if rntj = whenever 



A matrix M 
- i\ > w. 



The value of band(M) is if M is diagonal, 1 if M is tridiagonal and 2 if M is pentadiagonal. 
Gilbert Strang posed the following conjecture [7]: 

Conjecture 1 (Strang). A finite permutation matrix of bandwidth w > can be written as the product 
of at most 2w — 1 bandwidth-1 permutation matrices. 



Example 2. 
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— RIR2. 



This paper aims to prove Conjecture [T] and explore topics opened during the development of this 
proof. The next three sections (Sections [U [31 andS]) will cover the proof of the conjecture, outlining a 
greedy "parallel bubblesort" [7' strategy to determine a factor per iteration, starting from a specific key 
class of permutation matrices through progressively larger classes of permutation matrices. Section [5] 
concludes the paper with some points of further interest. 



2 Strang Canonical Matrices 

This section performs three tasks. First, it outlines the general "parallel bubblesort" algorithm. Sec- 
ond, it introduces two classes of permutation matrices: Strang canonical matrices and settled matrices. 
Third, Theorem |31 the main result of this section, establishes that Conjecture [T] holds for tracefree 
settled Strang canonical matrices with so-called reducing matrix factors. 



Definition 3. If P is an n x n permutation matrix, then the mth row of P, 1 < m < n is row,„(P), 
the mth column of P is colm{P) and [coli(P) • • • col„(P)] :— {Px)' where x' — [\ ■ ■ ■ n]. 

If^^'i<j^n, rowj(P) and rowj (P) are an inverted pair if co\i{P) > colj(P) and are a 
contented pair otherwise. 

m 

If P — Y[ '^k, where Tk are bandwidth-! permutation matrices, then fact(P) := m. 

k=l 

If an n X n permutation matrix P represents a permutation a, then coli(P) — cr{i) for 1 < i < n, 
and each inverted pair of P represents an inversion of a. Also, band(P) = maxi<i<„ |colj(P) — z|. 

Remark 4. The bubblesort algorithm J^ p. 40] shows that each permutation a is the product of 
transposition of adjacent elements, (J = Y\Ti- Let Pp be the permutation matrix representing p. Then 
Pr- are all bandwidth-l permutation matrices, and Pa — Y\ P^ ■ Thus, a permutation matrix can 
always be written as a product of bandwidth-1 permutation matrices. 

m 

Let P = Y[ Tk be a permutation matrix where Tk are bandwidth-1 permutation matrices. Since 

k=l 

permutation matrices are unitary and bandwidth-1 permutation matrices are inversions. 

Cm \ ~ m m 

yitA ^y[ Tm+i-k'^ = n Tm+i-k = p'- 
fe=l / /c=l fe=l 

Thus, fact(P) = fact(P^^j. Whenever such a product is defined, denote the indexed matrices Pq = P 
and Pk - TkPk-i = (U Tk+i-i) P. 

Lemma 2. If P = Pq is a finite permutation matrix, and Pk = BkPk-i where Bk is a nonidentity 
permutation matrix that performs swaps only on inverted pairs of Pk~i, then there is a number m such 
that Pm = I ■ 

Proof. Since Bk makes some inverted pairs of Pk-i contented, the number of inverted pairs of Pk is 
less than that of Pk-i- Since the number of inverted pairs of P is finite, there must be a number m 
such that P„i = /. D 

When Bk only swaps adjacent rows. Lemma [2] describes "parallel bubblesorting" , as each bub- 

m 

blesort iteration reduces the number of inversions of a permutation, and P = Jl Pfe is the required 

k=l 

decomposition. The rest of the paper investigates the greedy selection operation {Bk} to ensure that 
m = fact(P) < 2w where w — band(P). 

ni > , 

Given P = 11 -^fc ^^ ^^ Lemma [21 for < z,j < to, I'OWm. {Pi) = roWm. {Pj ) if col„i;(Pi) = 

k=l 
C0lm,(Pj). 

Definition 5. Treating P as a block diagonal matrix with the finest partition, each diagonal matrix is 
called a section of P. ^1x1 section is trivial. 

A row of a permutation matrix is said to be positive ('negative, neutral^ if the 1 is to the right of 
(to the left of, on, respectively) the diagonal. A column of a permutation matrix is said to be positive 
^negative, neutralj if the 1 is above (below, on, respectively) the diagonal. 

Each section Si, . . . , Sk of a permutation matrix P is a permutation matrix, and fact (P) = 
max{fact(5'i) , . . . ,fact(5fc)}. Each nontrivial section has a positive top row, a negative bottom row, 
a negative leftmost column and a positive rightmost column. 

Remark 6. Each 1 on the diagonal of a permutation matrix P determines a neutral row and column. 
Observe that the number of Is in the upper triangle of P is the sum of the number of its positive and 
neutral rows, and the sum of the number of its positive and neutral columns. Thus, P has the same 
number of positive rows and columns. Observing the number of Is in the lower triangle of P similarly 
shows that P has the same number of negative rows and columns. 
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Figure 1: Signs of rows and columns of P 



Definition 7. Two permutation matrices are row-sign-cquivalcnt ('colunin-sign-cquivalcnt j if the signs 
of their rows (columns) are the same. 

A section is row-settled if all of its positive rows are above its negative rows, column-settled if all 
of its negative columns are to the left of its positive columns and settled if it is either row-settled or 
column-settled. 

A row-settled (column-settled) matrix has only row-settled (column-settled) sections, and a settled 
matrix is either row-settled or column-settled. 

A section is upper-canonical (^lower-canonicalj, or in upper-canonical form (^lower-canonical formj, 
if its positive (negative) rows are pairwise contented. A section is Strang canonical ('half-canonicalj, 
or in Strang canonical form ('half-canonical formj, if it is in both (either) upper- canonical and (or) 
lower-canonical form. 

A permutation matrix is upper- canonical (lower-canonical) , or in upper- canonical form (in lower- 
canonical form) if all of its sections are in upper- canonical (lower- canonical) form. A permutation 
matrix is Strang canonical (half- canonical), or in Strang canonical (half- canonical) form, if it is in 
both (either) upper- canonical and (or) lower-canonical form. 

The inverse of a row-settled matrix is a column-settled matrix, and vice- versa. The inverse of an 
upper-canonical matrix is a lower-canonical matrix, and vice versa. 

If U is an upper-canonical matrix and the Is in its upper triangle excluding its diagonal are in rows 
Pi, . . . ,pk such that Pi < Pi+i, then colp. (L'^) < co\p^^^{U) for 1 < i < fc. If L is a lower-canonical 
matrix and the Is in its lower triangle excluding its diagonal are in rows ni, . . . ,ni such that ni < rii+i, 
then col„;(L) < col„.^j(L) for 1 < i < i*. 

Example 8. The only n x n, n < 4, sections that are not in Strang canonical form are 
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The eight 4x4 tracefree settled sections are the above three matrices and 
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Remark 9. A Strang canonical matrix is uniquely determined by the signs of its rows and columns. 
The mth 1 on its diagonal is at the intersection of its mth neutral row and mth neutral column. The 
mth 1 in its upper triangle, excluding the diagonal, is at the intersection of its mth positive row and 
mth positive column. The mth 1 in its lower triangle, excluding the diagonal, is at the intersection of 
its mth negative row and mth negative column. 

Definition 10. A reducing swap R of a permutation matrix P is an elementary matrix that swaps 
adjacent rows of P where the upper row is positive and the lower row is negative. The pair of rows are 
reduced by the swap. The reducing matrix R of permutation matrix P is R — Y\R over all possible 
reducing swaps R of P, and the reduction of P is RED(P) := RP. 



Each reducing swap makes an inverted pair contented, and if R is the reducing matrix of a non- 
identity permutation matrix, band(i?) = 1. 

Example 11. In Example\^ P — R1R2, where Ri is the reducing matrix of P and RED(P) — Pi — 
RiP = R2. 

m 

Lemma 3. If P is a Strang canonical matrix whose nontrivial sections are tracefree, then P — Y\ Rk, 

k=l 

where Rk is the reducing matrix of Pk-i- 

Proof. The permutation matrix P and RED(P) have the same Strang canonicity, since reducing swaps 
only exchange the positions of a positive row and a negative row, so each Pk is Strang canonicaL 

Since the nontrivial sections oi P = Pq are tracefree, the only neutral rows of P are in trivial 
sections. Let 

r^fc Bk 

,k > and Pfc-i 



Pk 



A-k-i Bk-i 
Cfc-i -Dfc-i 



1 

Ck Dkj 

The indicated neutral row of Pk is in a trivial section if Bk and Ck are zero matrices. Let Ak be 
(m — 1) X (m — 1) and roWmiPk-i) be signed. 

If roWjn{Pk-i) is negative, let Ak-i he m x m. Since Pk-i is Strang canonical, every negative 
lOWjn' (Pk-i) with m' < m has its 1 in Ak-i- Then rowm-i(-Pfe-i) is positive with co\m-iiPk-i) — fn 
and every positive TOWm'{Pk-i) with m' < m has its 1 in Ak-i- Since each neutral TOWm'{Pk-i) with 
m' < m has its 1 in Afc_i, both Bk~i and Ck-i are zero matrices, so Bk and Ck are zero matrices. 

If rowjn{Pk-i) is positive, let Ak-i be {m — 1) x {m — 1), and it similarly follows that Bk-i, Ck-i, 
Bk and Ck are zero matrices. 

Thus, P and RED(P) are Strang canonical matrices with tracefree nontrivial sections, and the 
conclusion follows from Lemma [2] D 

Theorem 4. A tracefree settled Strang canonical matrix of bandwidth w can be written as the product 
of less than 2w bandwidth-1 matrices. 

Proof. Let P be a row-settled Strang canonical matrix with tr(P) — and band(P) — w. Thus, it has 
an n X n row-settled Strang canonical section S with tr(S') — and band(S') = w. Let colm(S') = n. 
Then the upper m rows of S are positive, and the rest are negative, with co\m+iiS) = 1. From 
Lemma [31 a row-settled Strang canonical section is the product of reducing matrices: once a row is 
reduced, it is reduced by the next reducing matrix, until it is in a trivial section. Hence, 

fact(S') = max {col,;(S') + m — 2 * i} = max {2*i — m~l — coh(S')} (*) 

l<i<m T?i<z<n 

|coli(S') — i\ < w indicates the swap count, which is the number of reducing swaps for rowi(S') to be 
placed in the coli(S')th row, and m — i, li i < m, (i — (to + 1), if i > m) is the delay count, which is 
the number of positive (negative) rows that must be reduced before the positive (negative) rowi{S) is 
first reduced. 

Since colm(S') — n, S has n ~ m — colm(5) — m < w negative rows; since colm+i(5) — I, S has 
m = m + 1 — colm+i(5') < w positive rows. Thus, if band(5) = w, then fact(5') < u; — 1 -I- u; = 2w — 1. 

Since a column-settled Strang canonical matrix P is the inverse of a row-settled Strang canonical 
matrix P~^ and, by Remark|4l fact(P) = fact(P^^), the conclusion follows. D 

Remark 12. If C is an n x n circulant nonidentity permutation matrix, then it is Strang canonical, 
row-settled and column- settled, with tr(C) = and fact(C) = n — 1, all reducing matrices. Moreover, 
for some k, 1 < k < n, it follows that colm(C) = ((fc -I- ?7i — 2) mod n) + 1, 



fc-1. 


ifk- 


- 1 > n/2 


-fc + 1, 


ifk- 


- 1 < n/2 



band(C) =max{coli(C) — l,n — col„(C)} = 

calfe(C) is its first positive column and roWn-k+2{C) is its first negative row. fact(C) is tight with 
Strang's bound if n ~ 2w and w — band(C). 



Corollary 5. Let C be a circulant matrix, R be a row-settled Strang canonical matrix which is row- 
sign- equivalent to C and P be a column-settled Strang canonical matrix which is column- sign- equivalent 
to C. Then fact(C) > fact (i?) , fact (P) . 

Proof. From Theorem |31 for each row of R and C with the same index, the delay count is the same, 
but the swap count is maximum for C. So, from Equation Q, fact(C) > fact(i?). Since P~^ is 
row-settled and Strang canonical and C^^ is circulant, fact(C) — fact(C~^) > fact(P^^) = fact(P) 
by Remark m D 



3 Overtaking Swaps 

Theorem [6l the main result in this section, establishes that Conjecture [1] holds for settled Strang 
canonical matrices by a comparison with tracefree matrices. 

Remark 13. Since the neutral rows of a permutation matrix P = Pq are pairwise contented, if 
Pk = BkPk-i IS in Lemma\^ and all the signed rows of P are neutral in Pm, then Pm = I ■ 

Definition 14. An overtaking swap of a permutation matrix P is an elementary matrix that swaps an 
adjacent inverted pair of P where either the upper row is not positive or the lower row is not negative. 
The upper positive row or the lower negative row overtakes the other row by the swap. 



If p. 



where A is (m — 1) x (m — 1), define INSm(P) 



and INS„ 



^(P):=INS„,(---(INS™,(P))). 



IfP = 
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where A is (m — 1) x {m — 1), define DEL,„(P) := 
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and DEL„,,,...,,„JP) := DEL,„J- • ■ (DEL„,,(P))) . 

If P is a nonidentity permutation matrix whose neutral rows have indices ri,...,rk, ri > r^+i, 
ESS(P) := DEL^i rk{P) ** the essential form or essence of P. 

Remark 15. If P is annxn permutation matrix and I < m < n, then band(INS„i(P)) < band(P) + l. 

Example 16. To demonstrate the effect of inserting neutral rows on the number of factors, consider 

Ti — , the unique bandwidth~l section. The factorization of Tk = INS2(rfe-i), with matrices 

containing overtaking swaps underlined is as follows: 

r2 = iNS2(ri) = 

r3 = iNS2(T2)= "::r" = ,t"rr "r::r :""r;a«rf 
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r4 = iNS2(r3 
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0102R1 R2 R3 , 



where Oi swaps the first and last pairs of rows of T^ and O2 swaps the middle row of O1T4. 



When a neutral row is overtaken, it assumes the sign of the overtaking row. 

Theorem 6. A settled Strang canonical matrix of bandwidth w can be written as the product of less 
than 2w bandwidth-1 matrices. 

Proof. To show the result for a settled Strang canonical matrix P, a circulant matrix C will be used 
to determine an upper bound for fact(P). 

Let C he nxn with calc(C) as its first positive column and roWr(C) as its first negative row. From 

n-l 

Remark [T2l C — Y\ Rt where Rk is the reducing matrix of Ck-i. 

k=l 

For P = INSm(C), let w — band(P) — band(C) + 1. If to = r, the neutral row is inserted between 
the positive and negative rows. The initial reduction Ci is delayed by an overtake of the neutral row 
by roWm'(P). If 2to, < n, the neutral row is closer to rowi(P). If 2m > n, it is closer to fow„+i(P). 
So the neutral row is overtaken toward whichever row between rowi(P) and row„+i(P) it is closer to, 
or either if 2m = n, and roWm,/ (P) has the sign of ri — 2m. Let the overtaking swap be O and OP 
replace P. 

Step 1 If roWr'(P) is first negative row of P, then r' G {r^r + 1}. Let k = min{r' — l,n + 1 — r'} 
and e = |r' — fc| G {l,n + 1}. For the matrices Oq with 1 < q < fc, if col„i (P<j-i) — m, then 
Oq = INS„i (Rq) Oq with Oq = I H Pq-1 is Strang canonical or Oq is the overtaking swap of 
roWm (Pg-i) otherwise. 

Step 2 Pfe is Strang canonical and rowe(Pfc) is neutral. For q > k, Oq can be determined by showing 
that Pfc = INSefCfc-i] where: 

Case 1 If roWm(Pfe) is neutral, then Pk = INSe,e(C'fc_i), so C* = INSe(C'), where C is an 
n — 1 X n — 1 circulant matrix. 

Case 2 If to. is between c and r oi m — c, then Pk — INSe(Cfc_i), so C — C. 

Case 3 Otherwise, C is a row-settled Strang canonical matrix which is row-sign-equivalent to 
C. 

Then, for k < q < fact ic), Og+i = INSe [Rq) , where Rq is the reducing matrix of Cq-i and, 

from Corollary [g fact(P) = fact fc) + 1 < n + 1. 

q _ 
Upon the completion of the above steps, it can be determined that P — Yi ^k where band(Ofc) — 1. 

fc=i 
li m — r = c, then n = 2(to — 1) and band(C) = m — 1. Thus, band(P) — m — w,q — n + l = 2w — 1, 

Oi = O and Ok+i = Ok ior 1 < q < n. Otherwise, q G {n — l,n} and, by RemarklT^ 2{w — l) — l > n — 1 

making 2w — l>n + l>q. Therefore, for P ~ INSm(C), fact(P) < 2w, and this bound is tight only 

when m = f + 1- 

As seen in Example [111 the parity of the number of neutral rows inserted as a block, say P = 

INSTO,..._m(C'), whether it is an odd or an even number, may affect fact(P) differently. In particular, 

when TO =1 + 1, fact (INS™,™ (C)) = fact(INS,„(C)) and band(INS™,„(C)) = band(INS„(C)) + 1. 

For TO-i, . . . , mi such that 1 < rui+i < rui <n, 

fact(INS™i,...,™,(C)) - fact(C) = ^ (fact(INS„.(C)) - fact(C)) . 

i=l 

Multiple blocks of neutral rows inserted to produce P occasionally add a single bandwidth~l fac- 
tor, whenever Pq contains a neutral row between rows of the opposite sign, such as, if r' > r, for 
INS,-,r+2,r+2,r+2(C') and for INSr'-r,...,r'-r(C') where there are r -\- 1 neutral rows inserted. 



Therefore, for any circulant nonidentity permutation matrix C, given the class Cc = {P '■ ESS(P) = 
C}, if dp — 2band(P) — 1 — fact(P), then dc > 0, dc = dp when C is a 2m x 2m matrix and 
P = INSto+i(C), otherwise dp > dc whenever P ^ C,. 

Finahy, if i? is a row-settled Strang canonical matrix which is row-sign-equivalent to C, from Corol- 
laryOand by following the previous arguments, for every set {mi, . . . ,mt}, band(INSTOj....^„ij(-R)) ~ 
hand{mS,ni,...,mtiC)) and fact(INS™i,...,,„,(i?)) < fact(INS,„i,...,„i,(C)). The argument holds for 
column-settled Strang canonical R~^, and the conclusion follows. D 

Remark 17. If P is a settled Strang canonical matrix with band(P) = w and f neutral rows, then 
fact(P) < 2w — f, by the proof of Theorem\^ noting the tight-bound exception. 

4 Opportunistic Overtaking 

The main result of this section is the completion of the proof of Conjecture [T] with opportunistic- 
overtaking matrices — a greedy generalization of reducing matrices — along the construction used in 
Theorem [6l 

Definition 18. If P is a permutation matrix, then INVm(P) is the minimal submatrix containing 
only consecutive rows of P such that row„i(P) and all of the rows of P that are pairwise inverted with 
TO^m{P) are in INVm(P). 

An inverted block of a permutation matrix is a maximal submatrix containing only consecutive 
rows such that all the rows are pairwise inverted. 

An opportunistic-overtaking matrix O of a permutation matrix P is the product of the reducing 
matrix of P and the overtaking swaps of P such that, for every inverted block of P, the only rows that 
O can leave unswapped are the first and the last rows of the block. The collection of all products of P 
with any of its opportunistic-overtaking matrices O is denoted by OOM{P) 9 OP. 

Given a signed roWm(P), removing its sign produces a matrix P' := FlXm(P)- If rn' is such that 
co\m'{P) = TTi, then: rowfc(P') = rowfc(P) when k ^ m,m'; rowm'(P') = row,„(P); and roWm(P') is 
neutral. 

For every roWm(P) not in a trivial section, the top row of INV,„(P) is positive and the bottom 
row of INVm(-P) is negative. If P is upper-canonical and row„i(P) is positive, it is the top row of 
INVm{P)- If P is lower-canonical and row,„(P) is negative, it is the bottom row of INVm(P)- 

If the rows of an n x n permutation matrix P, from roWi(P) to rbWj(P), form an inverted block, 
then 

• either i = 1 or rowi_i(P) and row^ (P) are a contented pair, and 

• either j = n or roWj(P) and roWj+i(P) are a contented pair. 

If P is a nonidentity permutation matrix and O is any of its opportunistic-overtaking matrices, 
then band(O) = 1 and O satisfies the "parallel bubblesort" condition of Bk from Lemma [2l while 
providing a locally-optimal, i.e. greedy, condition to determine the next bandwidth-1 factor, in that 
P and OP G OOM.{P) share no inverted pairs. 

The only inverted blocks that a Strang canonical matrix has are reducible pairs and neutral rows 
with a positive and/or a negative row to overtake it. The product of any permutation matrix and any 
of its opportunistic-overtaking matrices is a permutation matrix whose inverted blocks have no more 
than three rows. 

A permutation matrix P always has a unique reducing matrix. P has a unique opportunistic- 
overtaking matrix only if each inverted block of P that has more than two rows has a reducible pair, 
otherwise that block can have two choices of overtaking swaps. 

An algorithm for determining an opportunistc-overtaking matrix of P is given in the Appendix. 



Example 19. //, as in Theorem^and the sections in Lemma\^ P is Strang canonical and tr(P) = 0, 
then its reducible pairs are inverted blocks and OOAi{P) — {RED(P)}. In the proof of Theorem\^ 
PgeOOM{Pg-i). 

Theorem 7. A permutation matrix of bandwidth w can be written as the product of less than 2w 
bandwidth-1 permutation matrices. 

Proof of Conjecture [7J The proof will relax the conditions on the permutation matrix and prove that 
the conjecture holds for each relaxation. 

Let P be a lower-canonical matrix with band(P) = w and P — Y\ Ok where OkPk-i — Pk ^ 

fe=i 
OOM.{Pk-i)- A negative row of P will be in a trivial section in some Pk only by being swapped by 
Ok with a positive row of P. So, let roWm(P) be positive. 

Case 1 If row„j (P) ~ row„j^. {Pk) is never overtaken in {Ok}, the plan is to localize the determination 
of the swaps that move row„j (P) in {Ok}- 

First, determine P™ such that, through removing the signs of the positive rows that are 
overtaken by roWm(P)- P™ has no such positive row. 

Next, determine the matrix P™ localizing to the swaps of row„i (P) and the rows that are 
inverted with it. Then, P™ is column-settled and Strang canonical whose only nontrivial 
section is from row„i(P™) to row£,^(P™), which is row-sign-equivalent to INVmlP™). 

If, by Theorem [6l P™ = Y\ T™, with T™ the possibly identity elementary matrix performing 

k=l 

the swap in T™ that moves row„i^(P™) = row„i(P™), can be performed on row,„(P™ j, and 
thus can also be performed on row„i (P) . 
There are two scenarios to consider: 

Sub-Case 1 Assume that there is a reducible pair in INVm (P) that is not in INVm ( P"^ ) ■ 
Since both rows have their signs removed, their reducing swap will be in Oi, and 
row„j (P) will be in a trivial section in Pq^-^-i. By Remark I17i qm — Qm + ^ < 
2w - fm + ^ with fm > 1- 

Sub-Case 2 Otherwise, roWm(P) will be in a trivial section in Pg^. By Remark [T71 qm — Qm < 
2^ - fm- 

Case 2 Let the mth row be overtaken in {Ok}- From the previous case, for each row^' (P) overtaking 
roWm(P) in {Ok}, row^' (P) is in a trivial section in Pg , with qr' < 2w — fr' where fr' is the 
number of nonnegative rows in INVr'(P). Again, there are two scenarios to consider: 

Sub-Case 1 If the final swap of row„j (P) in {Ok} is with one of the rows overtaking it, say 
row,.' (P) , then row„j (P) is in a trivial section in Pg^ , where qm < qr' < '^w — fr' - 

Sub-Case 2 Otherwise, roWm(P) is positive just before it is in a trivial section, and all rows that 
can overtake it have overtaken it before it is swapped into a trivial section. Thus, 
after the last row overtakes it in Pfc, roWm(P) = roWm^ (Pfc) is above its overtaking 
row roWr^. (Pfc), and once row^^ (Pfc) swaps with a row below it, row„i (P) can swap 
with the row it was overtaken by unless it was first overtaken by roWrj.(Pfe) and is 
contented with rowm(P). Then roWm(^) has a delay count trailing row^^, (Pfc) of 
at most /rt and foWm(P) is in a trivial section in Pq^ , where qm < qrk + fr^ < Sic. 

Since fact(P) = q ^ niaXcoi,„(p)^m ^m, then q < 2w and the conjecture holds for lower-canonical ma- 
trices. Since an upper-canonical matrix is the inverse of a lower-canonical matrix, the same conclusion 
follows from Remark ID 



If P is not half-canonical, P™ can be replaced in Case [T] by a column-settled upper-canonical 
matrix, where the same negative rows of P™ and P constitute an inverted pair, and the results will 
similarly follow. D 

If, in the above proof, P is Strang canonical, then only Sub-Case 2 of Case [1] holds for each signed 
row roWm(P). 

5 Further Points for Analysis 

Panova ^5, proved Conjecture [I] through the use of wiring diagrams. This approach is similar to 
determining multi-braids. From braid theory, by the Artin relations |2, Eq. 18, 19], braids that do not 
share a thread commute: here, any number of commuting braids can be combined, without ambiguity, 
into a single multi-braid. It is of interest to compare the factors derived from the approach in [5], 
as with the method of Albert, Li and Yu [7, Sec. 4], which is not yet readily available, with the 
opportunistic-overtaking matrices approach. 

Definition 20. The distance table of an n x n permutation matrix P is dist(P) :— (P — I)x where 

x' = [1 ... n]. 

Remark 21. The total number of reducing and overtaking swaps in the factors of a permutation 
matrix is the number of inverted pairs of that matrix. This number is also half the sum of absolute 
values of the entries of its distance table, plus the number of rows that can overtake each signed row 
and half the number of rows that can overtake each neutral row. 

m 

Given P = Y[ ^k , o-s in Lemma if the distance tables are taken as sequences, then, for the 
fc=i 
following standard norms. 



\ti 



||dI^(Pfc)||,^ < lldSt(Pfc-i) 
band(Pfc) - ||dIst(Pfe)||,^ < ||dist(Pfe_i)||,^ =band(Pfe_i), 
||dSt(Pfe)||,^<||dist(Pfe_i)||,^, 

indicating that the Manhattan and Chebychev distances cannot increase and that the Euclidean distance 
always decreases. 

The previous remark indicates that the subproducts Pfe of a given permutation matrix P are 
"diffusions" of the initial state dist(P), where a "parallel bubblesort" iteration is performed in each 
"time-step" . This may be better analyzed if a relevant basis can be found. 

Definition 22. A greedy bubble matrix G of a permutation matrix P is a product of reducing and 
overtaking swaps of P such that P and GP have no common inverted pairs. The collection of all 
products of P with any of its greedy bubble matrices G is denoted by QBAi{P) 3 GP. 

rn 

An optimal factorization of a permutation matrix P is Y[ ^k — P where band(Tfe) = 1 and each 

fe=i 

other factorization of P into bandwidth-1 matrices cannot have less factors than fact(P) := m. 

OOM{P) C QBM{P), but a greedy bubble matrix of P need not include reducing swaps of P. 

A breadth-first spanning-tree algorithm [3] Sec. 22.2] rooted in the identity matrix applied to the 
Cayley graph of the symmetric group of length n, corresponding to set oi n x n permutation matrices, 
whose connection set is the set of all permutations represented by bandwidth-1 matrices T can be 
used to determine fact (P) for any n x n permutation matrix P. 

Remark 23. The number of nxn permutation matrices of bandwidth w, w <1, is the nth Fibonacci 
number, P„, where Fq = F\ = 1. 



In testing n x n, n < 9, permutation matrices, the following were observed for every permutation 

m 

matrix P: there is an optimal factorization P = Yi ^k, such that Pk € QBJ^{Pk-i) and fact(P) < n. 

k=l 

The former observation suggests that a greedy algorithm [3, Ch. 16] can determine fact(P); Re- 
mark [nH indicates that the use of greedy bubble matrices is advantageous. Further observation leads 
to the following conjecture: 

Conjecture 8. A finite permutation matrix of bandwidth w > is the product of less than 2w greedy 
bubble matrices. 

Conjecture [8] asserts that, if Pq = P and Pk £ QBM{Pk-i), then, for some m < 2w, Pm = I- 

Of the tested greedy algorithms on n x n permutation matrices, n < 9, fact(P) < fact(P) + [n/3j . 

The latter observation seems provable from Theorem [7] where, if P is a permutation matrix, 
band(P) — w, then for every signed row™ (P) , INVm(P) has at most 2w rows. A. M. Bruckstein 
suggests using the sequence of adjacent transpositions to exhaustively generate all permutations of a 
given length, such the (Steinhaus-) Johnson- Trotter algorithm [8], as suggested in [4] or in [6j Table 5], 
and the Artin relations [5]. 

D. Pasechnik suggests that the conjecture does not hold for infinite matrices. 
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Appendix: Opportunistic-Overtaking Matrix Algorithm 

Given: a permutation matrix P 

Output: an opportunistic-overtaking matrix O of P 

• Initialize O = I and determine the inverted blocks of P, Pi, B2, ■ ■ ■ , B^ 

• For each inverted block of P, P^, 1 < i < fc 

— If Bi has a reducible pair, rowm(P) and row,„+i(P): 
true: While roWm_2(P) is in Bi, set m to m — 2 

false: Let row„j(P) be the top row of Pj 

— While row,„-)_i(P) is in P^, swap row,„(0) and row,„-)_i(0), then set m as m-\- 2 
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